Ensemble Genetic Programming for Classifying Gene Expression Data
نویسندگان
چکیده
Ensemble is a representative technique for improving classification performance by combining a set of classifiers. It is required to maintain the diversity among base classifiers for effective ensemble. Conventional ensemble approaches construct various classifiers by estimating the similarity on the output patterns of them, and combine them with several fusion methods. Since they measure the similarity indirectly, it is restricted to evaluate the precise diversity among base classifiers. In this paper, we propose an ensemble method that estimates the similarity between classification rules by matching in representation-level. A set of comprehensive and precise rules is obtained by genetic programming. After evaluating the diversity, a fusion method makes the final decision with a subset of diverse classification rules. The proposed method is applied to cancer classificaiton using gene expression profiles, which requires high accuracy and reliability. Especially, the experiments on popular cancer datasets have demonstrated the usefulness of the proposed method.
منابع مشابه
The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming
OBJECT The classification of cancer based on gene expression data is one of the most important procedures in bioinformatics. In order to obtain highly accurate results, ensemble approaches have been applied when classifying DNA microarray data. Diversity is very important in these ensemble approaches, but it is difficult to apply conventional diversity measures when there are only a few trainin...
متن کاملGenetic Programming Based Formulation to Predict Compressive Strength of High Strength Concrete
This study introduces, two models based on Gene Expression Programming (GEP) to predict compressive strength of high strength concrete (HSC). Composition of HSC was assumed simplified, as a mixture of six components (cement, silica fume, super-plastisizer, water, fine aggregate and coarse aggregate). The 28-day compressive strength value was considered the target of the prediction. Data on 159...
متن کاملCancer Prediction Using Diversity-Based Ensemble Genetic Programming
Combining a set of classifiers has often been exploited to improve the classification performance. Accurate as well as diverse base classifiers are prerequisite to construct a good ensemble classifier. Therefore, estimating diversity among classifiers has been widely investigated. This paper presents an ensemble approach that combines a set of diverse rules obtained by genetic programming. Gene...
متن کاملDecision Support Systems Using Ensemble Genetic Programming
This paper proposes a decision support system for tactical air combat environment using a combination of unsupervised learning for clustering the data and an ensemble of three well-known genetic programming techniques to classify the different decision regions accurately. The genetic programming techniques used are: Linear Genetic programming (LGP), Multi-Expression Programming (MEP) and Gene E...
متن کاملGene Expression Classification using Multi-Objective Ensembles
The classification of microarray data is an important application of computational intelligence techniques. It is important for the techniques chosen to construct models that are accurate, parsimonious and explicable. This paper investigates the use of multi-objective genetic algorithms to create a number of rules for classifying gene expression data and the combination of these rules into an e...
متن کامل